New Features for Language Identification Using Gmm
نویسندگان
چکیده
Automatic Language Identification (LID) is the process of identifying the language spoken within an utterance. The challenge that this task presents is that no prior information is available indicating the content of the utterance or the identity of the language. Most of the existing LID systems are based on MFCC feature vectors. This paper introduces the use of new feature extraction approach for LID task. For this, an approach is proposed to derive a new type of feature vectors from speech signal alone. The feature extraction method is based on the frequency of occurrence of phonemes is different for different languages. The variations in frequency of occurrence of phonemes are effectively captured using Gaussians. These variations are captures in the form of probability vectors using Gaussians. This approach outperformed the existing conventional MFCC feature vector based LID systems.
منابع مشابه
مقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملدر کاربرد تشخیص زبان گفتاری GMM-VSM در قالب سیستم GMM
GMM is one of the most successful models in the field of automatic language identification. In this paper we have proposed a new model named adapted weight GMM (AW-GMM). This model is similar to GMM but the weights are determined using GMM-VSM LID system based on the power of each component in discriminating one language from the others. Also considering the computational complexity of GMM-VSM,...
متن کاملAn Automatic Language Identification Using Audio Features
An automatic Language Identification (LID) is the task of automatically recognizing a language from the given spoken utterance. Language identification is used to identify the language of the particular audio and reduce the complexity of the audio sample. LID systems that rely on multiple language phone recognition language modeling (PRLM) and n-gram language modeling produces the best performa...
متن کاملUsing i-Vector Space Model for Emotion Recognition
Using i-vector space features has been shown to be very successful in speaker and language identification. In this paper, we evaluate using the i-vector framework for emotion recognition from speech. Instead of using standard i-vector features, we propose to use concatenated emotion specific i-vector features. For each emotion category, a GMM supervector is generated via adaptation of the neura...
متن کاملSpoken Language Identification Using Hybrid Feature Extraction Methods
This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...
متن کاملSpeaker vectors from subspace Gaussian mixture model as complementary features for language identification
In this paper, we explore new high-level features for language identification. The recently introduced Subspace Gaussian Mixture Models (SGMM) provide an elegant and efficient way for GMM acoustic modelling, with mean supervectors represented in a low-dimensional representative subspace. SGMMs also provide an efficient way of speaker adaptation by means of lowdimensional vectors. In our framewo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012